Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units
نویسنده
چکیده
Constructing speech feature extraction methods that are robust to many types of corrupting acoustic environments remains a daunting task and it is instructive to investigate which properties of the speech carry the discriminative information for recognition under a variety of conditions. In this paper we describe results for generating RASTA-style modulation filters under a number of acoustic environments. We utilize Linear Discriminant Analysis in a manner previously described by van Vuuren and Hermansky to automatically generate discriminant filters for speech with artificially added background noise and reverberation. We also generate the filters using both phonetic and syllabic classification targets. Trends in the responses of the discriminant filters lend support to feature extraction design decisions employed by RASTA-PLP and Modulation-filtered Spectrogram features. Further, tests with added reverberation corroborate views on the perceptual stability of syllabic rates.
منابع مشابه
Syllable structure based phonetic units for context-dependent continuous Thai speech recognition
Choice of the phonetic units speech recognizer is a factor greatly affecting the system performance. Phonetic units are normally defined according to the acoustic properties of a speech. Nevertheless, with the limit of training data, too delicate acoustic properties are ignored. Syllable structure is one of the properties usually ignored in English phonetic units due to a lot of possible onsets...
متن کاملDual-route phonetic encoding: some acoustic evidence
Contemporary psycholinguistic models suggest that there may be two possible routes in phonetic encoding: a 'direct' route which uses stored syllabic units, and an 'indirect' route which relies on the on-line assembly of sub-syllabic units. The computationally more efficient direct route is likely to be used for high frequency words, whereas the indirect route is most likely to be used for novel...
متن کاملConstrained Subword Units for Speaker Recognition
Phonetic features have been proposed to overcome performance degradation in spectral speaker recognition in difficult acoustic conditions. The harmful effect of those conditions, however, is not restricted to spectral systems but also affects the performance of the open-loop phone recognisers on which phonetic systems are based. In automatic speech recognition, larger subword units and the use ...
متن کاملTitle of dissertation : SPEECH RECOGNITION BASED ON PHONETIC FEATURES AND ACOUSTIC LANDMARKS
Title of dissertation: SPEECH RECOGNITION BASED ON PHONETIC FEATURES AND ACOUSTIC LANDMARKS Amit Juneja, Doctor of Philosophy, 2004 Dissertation directed by: Carol Espy-Wilson Department of Electrical and Computer Engineering A probabilistic and statistical framework is presented for automatic speech recognition based on a phonetic feature representation of speech sounds. In this acoustic-phone...
متن کاملA probabilistic framework for landmark detection based on phonetic features for automatic speech recognition.
A probabilistic framework for a landmark-based approach to speech recognition is presented for obtaining multiple landmark sequences in continuous speech. The landmark detection module uses as input acoustic parameters (APs) that capture the acoustic correlates of some of the manner-based phonetic features. The landmarks include stop bursts, vowel onsets, syllabic peaks and dips, fricative onse...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999